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Abstract 



A new strategy for global geometry optimization of clusters is presented. 
Important features are a restriction of search space to favorable nearest- 
neighbor distance ranges, a suitable cluster growth representation with di- 
minished correlations, and easy transferability of the results to larger clusters. 
The strengths and possible limitations of the method are demonstrated for 
Siio using an empirical potential. 



1 Introduction 



In recent years, many new methods have been developed for the problem 
of cluster geometry optimization (see, e.g., Refs 0, H, 1, H H H 0, P), 
that is, the search for the lowest energy configuration of a cluster of a given 
number of atoms or molecules with a given interaction. Also, attempts are 
being made to discover links between features in model potentials and the 
resulting minimal energy structures |10[ . Nevertheless, the following difficul- 
ties persist and are not fully solved (or, partly, not even addressed) by the 
above methods: 

1. exponential increase of the number of local minima with cluster size 
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2. suitable representation of the problem (e.g., choice of coordinates or 
parameters to be optimized) M, |9|, |T2"|| ; 



3. proper restriction of the search space [0, O, |T| 



4. high expense of ab initio potential energy surface calculation. 

Item (1) is well-known. Much less attention is paid to items (2) and (3), 
although they are just as important as the choice of a good optimization 
algorithm. Finally, the number and location of minima on a potential energy 
surface depends critically on details of the potential energy function used 



lfL5|| . Therefore, ideally, geometry optimizations should be performed at the 
ab initio level of theory; in that case, item (4) is of paramount importance. 

Recently, it has been proven that one cannot hope for discovery of a global 
optimization method that performs well for every conceivable problem and 
problem representation ("no free lunch theorem" [12|]). Hence, one should 
not try to tweak an existing optimization strategy to perform acceptably for 
a given problem, but instead construct a new strategy based upon prior infor- 
mation on the problem. In this Letter, I present GOALS (global optimization 
by optimized assembly of local structure), a strategy specifically designed to 
address problems (l)-(3) mentioned above, by the following features: 

1. transfer of knowledge from small clusters to larger ones; 

2. no direct optimization of simple coordinates of individual atoms (which 
will always exhibit a high degree of correlation); 
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3. all obviously nonsensical cluster structures are avoided by construction. 



Problem (4) will be dealt with in a subsequent paper. 

One ingredient of the new method is a cluster growth strategy. Such 
growth strategies have been used before (see, e.g., Refs |7|, [L6| and Refs cited 
therein, in particular also |T7j]), but only with rather crude ad-hoc methods 
of adding new atoms, followed by local optimization. Here, the different ways 
of growth themselves are subject to optimization. 

It is important to emphasize that this Letter is a presentation of a new 
method, not an investigation into particular cluster structures or into the 
quality of a particular empirical potential. 

This Letter is organized as follows: Section 2 contains the presentation 
of the general idea and of a particular implementation; in section 3, this 
implementation is applied to small silicon clusters; in section 4, limitations 
and further work are discussed. 



2 Method 
2.1 General idea 

The central idea of GOALS is to construct global energy minimum struc- 
tures of large clusters (of Nl atoms) without explicitly subjecting them to 
a global optimization strategy. Instead, structural information from globaly 
optimized smaller clusters (of N c atoms) is used to build up starting ge- 
ometries for the large clusters wich are then only locally optimized. At the 
same time, the problem of coordinates and proper search space restriction is 
avoided by construction, in the following way: 

"Cluster assembly agents" are introduced: Each agent can act on a cluster 
of iV atoms, for arbitrary values of N. It selects a particular site next to the 
cluster (or inside the cluster) according to given criteria, and adds a new 
atom such that a specific geometric arrangement is generated at this site, 
for example, a face capping or a dangling atom. The parameters to be 
optimized are then simply the relative probabilities of these agents. Proper 
design of these agents can approximate a one-to-one relationship between 
agents and local structural features in a cluster. In this way, correlations 
between parameters to be optimized are kept minimal. 
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Optimization of agent probabilities proceeds via a series of assemblies 
of a cluster of N c atoms, starting from a small seed structure of N s atoms. 
More atoms are added by choosing cluster assembly agents according to 
their probabilities, at each stage N s < N < N c ; the potential energy of 
the resulting cluster of N c atoms is minimized by variation of the agent 
probabilities. 

After completion of this stage, the resulting set of optimized cluster as- 
sembly agent probabilities can then be applied to generate good starting 
guesses for traditional, local optimizations of larger clusters (of Nl atoms, 
with Nl > N c ). These local optimizations will arrive at the global energy 
minimum structure(s) with unusually high probability, provided the local 
structures present in the global minima geometries of larger clusters are not 
very different from those found in the global minima geometries of smaller 
ones. 

2.2 Preliminary implementation 

In this Letter, I present a particular, preliminary implementation of GOALS. 
In order to keep the implementation simple, several approximations to the 
general concept are introduced: 

Each cluster assembly is done initially on fee lattice sites only. The fee 
lattice was chosen simply because fee and hep are known to be the closest 
packings of hard spheres in 3D, for N — + oo |TJj (the problem of optimal 
cluster packings with N < oo is still unsolved, though). Contrary to the hep 
lattice, the fee lattice points are easily encodable via an oblique coordinate 
system. Furthermore, many other lattices are related (in a formal way) to 
the fee lattice. Note, however, that the fee lattice only serves as a temporary 
template to establish neighborhood relationships. 

As an approximation to the cluster assembly agents of section 2.1, I in- 
troduce "numerical fec-agents" . They are solely characterized by the number 
of nearest fec-neighbors the atom will have that they are about to introduce. 
For example, imagine a triangle of 3 atoms on neighboring fee lattice sites. 
The "1-agent" adds another atom such that this new atom will have exactly 
1 nearest neighbor in the resulting 4-atom structure. That is, it introduces 
a dangling atom at one corner of the triangle. Similarly, if we instead let the 
"2-agent" operate on the trianlge, we will get a flat or bent rhombus. Action 
of the "3-agent" on the triangle generates a tetrahedron. Obviously, none of 
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the other agents (4-agent through 12- agent) is applicable to a 3-atom triangle. 
Of course, these numerical fcc-agents are a very crude approximation, but 
they have the following advantages: There are only twelve possible agents, 
thus we cannot miss one inadvertently. All twelve agents are easy to code, no 
exercises in artificial intelligence or pattern recognition are necessary. Nev- 
ertheless, the number of nearest neighbors is an essential feature which is 
directly connected to sterical crowdedness vs. openness, and to chains vs. 
flat structures vs. 3D ones. Disadvantages of this choice will be discussed in 
section IV. 

In order to arrive at realistic structures, independent of the fee lattice, 
each completed cluster is subjected to a traditional, local conjugate gradient 
optimization [ r9|] . The resulting minimized potential energy is taken as char- 
acteristic for the whole cluster assembly that lead to the fee structure prior 
to the local optimization. 

In order to destroy accidental symmetries present in the fec-based ge- 
ometries, small random displacements are added to all coordinates before 
doing the local optimization step. Both the assembly and the random dis- 
placements preclude a one-to-one relationship between the parameters to be 
optimized and the cluster geometries. Possible artifacts arising from this am- 
biguity are avoided by taking the average potential energy from five different 
sequences [assembly + random displacement + local optimization], for each 
set of agent probabilities. 

The agent probabilities themselves can be optimized using any global 
optimization method; here I am using the standard genetic algorithm (GA) 
as described by Holland and Goldberg |L3|, [14| and used in Ref ||. In a 
GA, the optimization problem is mapped onto an algorithm that mimics 
natural evolution. In particular, the parameters to be optimized are called 
genes, one particular instance of a complete parameter set is the genome 
of an individuum, and many such individuals form a population. The cost 
function measures the fitness of an individuum, which in turn determines how 
many children this individuum will have in the next generation. Optimization 
proceeds through mutation, exchange of information between individuals (via 
genetic crossover), and preferential inheritance of favorable genes. In the 
present case, each individual in the GA consists of 12 values for the agent 
probabilities; each of these values is encoded as a 5-bit binary number (this 
is a compromise between short genes and a sufficiently fine grid). An elitist 
strategy is employed (the best individuum of each generation is guaranteed 
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to survive unchanged into the next), since it seems to be performing slightly 
better than the standard non-elitist version. It turns out that fairly small 
GA's are sufficient in this context: 10 generations of 10 individuals each 
already yield satisfactory results. 

A separate program takes a given cluster assembly agent probability dis- 
tribution, builds new clusters of arbitrary size (again, on the fee lattice), 
and locally optimizes them (in cartesian coordinates, independent of the fee 
lattice sites). If this whole concept works, the probability to find the global 
optimum for these new cluster sizes will be significantly larger for agent prob- 
abilities optimized for smaller clusters than for random agent probabilities 
or for completely random starting geometries. 

To summarize, the final algorithm to be applied in the next section is 
this: 

stage 1: optimize agent probabilities for an A^-atom cluster by iterating the 
following steps: 

• choose agent probability values 

• starting from the A^-atom seed, use these probabilities to build 
up 5 A^-atom clusters 

• locally optimize these 5 clusters; the resulting minimal energies 
are i — 1, . . . , 5 

• let E av = | I]f =1 Ei 

and by minimizing E av . Actually, this whole stage is done by the GA, 
but this is omitted for clarity. 

stage 2: construct optimized larger clusters by the following steps: 

• use the optimized agent probabilities resulting from stage 1 to 
build up clusters of Nl atoms from the same N s seed 

• locally optimize these clusters. 



3 Exemplary application 

The first application of the GOALS strategy focuses on small silicon clusters 
and employs the empirical potential developed by Bolding and Andersen 
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This potential is particularly good for small clusters, as opposed to many 
others; and it is fairly complicated, hence effects of a complexity similar 
to an ab initio potential can be expected. The objective of this Letter is 
not an exhaustive exploration of this potential for large iV; therefore, this 
application is limited to clusters up to N = 10, for which the most important 
minimum structures are given in Ref ||20|| . 

In converting from fee structures to cartesian coordinates (required as in- 
put for the local optimization on the Bolding- Andersen potential), I used the 
Si-Si distance of bulk silicon as distance between nearest-neighbor fee lattice 
sites. This turns out to be a very reasonable choice, since most fee structures 
change only marginally upon optimization. This observation also allays the 
suspicion that the choice of the fee lattice could impede the algorithm in this 
application, as silicon prefers the diamond lattice in the bulk crystal. 

As a first, trivial check of stage 1 of GOALS, I have taken a 3-atom seed 
(a triangle on fee lattice points), and optimized the cluster assembly agent 
probabilities for a target cluster of N = 4 atoms, i.e., just one more atom is 
added. The global minimum structure for iV = 4 on the Bolding-Andersen 
surface is a planar rhombus, hence the following outcome is to be expected 
(cf. section 2.2): The 2-agent should be clearly favored over the 1-agent and 
the 3-agent. Obviously, all other agents are irrelevant for this example. In 
fact, the best individuum after 10 generations shows the following probability 
distribution: 0.19 for the 1-agent, 1.0 for the 2-agent, and 0.0 for the 3-agent 
(for simplicity, each single probability can take on values between and 1; the 
algorithm used does not require that all probabilities sum to 1.0). This shows 
that the agent probability optimization part of GOALS works as expected. 

A more realistic example for the full GOALS strategy is the global geom- 
etry optimization of Siio- Fig. 1 summarizes the results (the numbers shown 
are no full statistics, but merely particular runs; but I have checked that they 
are representative): 

Traditional, local optimizations of starting geometries with the coordi- 
nates of all atoms chosen at random have practically no chance of finding the 
global minimum at all. In 1000 runs, the global minimum at -39.2 eV was 
not found even once; and the lowest structure found (-33 eV) was not even 
close to it. 

A direct GA-optimization of the cartesian coordinates of all atoms (not 
shown) suffers from correlations between the coordinates, that is, the problem 
representation is unsuitable. Hence, this approach has problems in finding 
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the global minimum structures of clusters with N > 6. 

The next step in sophistication is generating starting geometries for local 
optimization by the cluster assembly agent approach, but with random prob- 
abilities for the agents (that is, using only stage 2 of section 2.2). In such 
structures, all nearest neighbor distances are in a favorable range; hence, this 
amounts to a massive but intelligent restriction of the search space. There- 
fore, the chances for finding the global minimum are drastically improved to 
4%. 

Using agent probabilities optimized for smaller clusters (i.e., the full 
GOALS strategy, stages 1 and 2 of section 2.2) leads to further improve- 
ment: If agent probabilities optimized for N = 8 are used for generating 
starting geometries for optimization of the N = 10 cluster, the chances for 
finding the global minimum rise to 21%. In fact, the overall performance 
is even better than this single number suggests: The chances of hitting one 
of the two lowest stationary points are close to 40%, and it is very hard to 
generate unfavorable structures with energies higher than -32 eV. 

If, however, agent probabilities optimized for N = 5 are used, chances for 
finding the global minimum for N = 10 are with 6% only marginally better 
than with random probabilities. The reason for this is a structural transition 
from planar geometries to 3D ones, which occurs between N = 5 and N = 7 
on the Bolding-Andersen surface. Such a drastic change is a peculiarity of 
the small cluster range studied here; there are also structural transitions in 



the higher- N range 211, but they are less severe. 



4 Discussion 

It must be emphasized that the Siio example given is not trivial: Lennard- 
Jones cluster studies have completely enumerated all minima up to iV = 13 
[TL\. For N = 8, only 8 minima are found, but 57(!) minima for iV = 
10. There is no complete enumeration of all local minima available for the 
Bolding- Anderson potential, but it is likely that the situation is similar. 

The numerical fee agents are only a crude approximation to true cluster 
assembly agents envisaged in section 2.1. By construction, there are only 
12 numerical fee agents, and one can easily think of structures that are not 
covered by them, for example Cgo-like geometries. Even more aggravating is 
the fact that some agents do lead to unique structural features, but others do 
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not: For example, the 12-agent obviously has a singular meaning; similarly, 
the 1-agent generates only dangling atoms or (linear or bent) chains. The 
3-agent, however, can mean anything between a closed tetrahedron, a capped 
triangular outside face, a reconstructed surface, or even a fully planar one. 
This ambiguity is reflected in the results shown above: the probability for this 
agent shows wide variations, even when others are already well converged. 
Similarly, the 4-, 5-, and 6-agents can still lead to (planar or reconstructed) 
2D forms instead of 3D structures, albeit with less probability. This is not in 
accord with the original idea of assembly agents: Each agent should lead to 
a unique structural feature in the resulting cluster; in this way, correlations 
between agent probabilities are minimized. Therefore, one can conclude that 
there is presumably some significant correlation left between the probabilities 
for the numerical agents. 

The fact that the method works in spite of these shortcomings shows 
that it also contains important, favorable features. As mentioned in the 
introduction, these are: 

• suitable restriction of search space; 

• diminished correlation between parameters to be optimized; 

• flattening of the exponential- iV-cliff of cluster geometry optimization, 
by easy transferability of cluster assembly agent probabilities from 
small N to large N. 

It can be argued that the possibility of unforeseen structural transitions to- 
tally precludes the application of the GOALS strategy in the latter sense. 
On the other hand, while the problem of structural transitions cannot be 
denied, strategies like GOALS that transfer information from smaller clus- 
ters or cluster subregions to larger clusters are the only way to weaken the 
full impact of the exponential increase in complexity of the cluster geometry 
optimization problem. 

Further exploitation of the GOALS idea clearly calls for development of 
better cluster assembly agents. Another aim of future work in this area is 
geometry optimization on the ab initio level; to this end, methods for more 
effective use of ab initio potential energy and gradient information are being 
developed. 
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Figure caption 



Figure 1: Probability in % of finding stationary Siio structures, using sev- 
eral optimization methods. Figure lb is a blow-up of the leftmost part of 
Figure la. On the x-axis, the potential energy of the structures is given 
in eV; for clarity all structures within 1 eV are binned together. This dis- 
torts the presentation somewhat; for example, the bin at -39 eV contains 
only one structure, the global minimum. The results shown are from 100 lo- 
cal optimizations of starting geometries generated by the following methods: 
randomly chosen atomic coordinates ( "random" , x ) , fee cluster assembly us- 
ing random agent probabilities ( "random-fee" , □), fee cluster assembly using 
agent probabilities optimized for Si 5 ("opt-5", +), and fee cluster assembly 
using agent probabilities optimized for Si 8 ("opt-8", O). For further details, 
see text. 
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